Why choose Random Forest to predict rare species distribution with few samples in large undersampled areas? Three Asian crane species models provide supporting evidence
نویسندگان
چکیده
Species distribution models (SDMs) have become an essential tool in ecology, biogeography, evolution and, more recently, in conservation biology. How to generalize species distributions in large undersampled areas, especially with few samples, is a fundamental issue of SDMs. In order to explore this issue, we used the best available presence records for the Hooded Crane (Grus monacha, n = 33), White-naped Crane (Grus vipio, n = 40), and Black-necked Crane (Grus nigricollis, n = 75) in China as three case studies, employing four powerful and commonly used machine learning algorithms to map the breeding distributions of the three species: TreeNet (Stochastic Gradient Boosting, Boosted Regression Tree Model), Random Forest, CART (Classification and Regression Tree) and Maxent (Maximum Entropy Models). In addition, we developed an ensemble forecast by averaging predicted probability of the above four models results. Commonly used model performance metrics (Area under ROC (AUC) and true skill statistic (TSS)) were employed to evaluate model accuracy. The latest satellite tracking data and compiled literature data were used as two independent testing datasets to confront model predictions. We found Random Forest demonstrated the best performance for the most assessment method, provided a better model fit to the testing data, and achieved better species range maps for each crane species in undersampled areas. Random Forest has been generally available for more than 20 years and has been known to perform extremely well in ecological predictions. However, while increasingly on the rise, its potential is still widely underused in conservation, (spatial) ecological applications and for inference. Our results show that it informs ecological and biogeographical theories as well as being suitable for conservation applications, specifically when the study area is undersampled. This method helps to save model-selection time and effort, and allows robust and rapid assessments and decisions for efficient conservation.
منابع مشابه
Climate change would enlarge suitable planting areas of sugarcanes in China
China’s sugar production and consumption continues to increase. This process is alreadyongoing for over 15 years and over 90% of the sugar production comes from sugarcane(Saccharum officinarum). Most of the sugarcane is planted in the south (e.g. the Chineseprovinces of Yunnan, Guangxi, Guangdong and Hainan) and it represents there a majoreconomic crop in these landscapes. As found virtually wo...
متن کاملPredicting the geographical distribution of Alopecurus textilis Boiss rangeland species on basis Consensus approach of climate change in Mazandaran province
The climate changes have an important role in distribution of plant species. Statistical species distribution models (SDMs) are widely used to predict the changes in species distribution under climate change scenarios. In the peresent study, the distribution of Alopecurus textilis in the current and future climate condition (2050) under the influence of climate change and two scenarios of RCP 4...
متن کاملComparing Different Modeling Techniques for Predicting Presence-absence of Some Dominant Plant Species in Mountain Rangelands, Mazandaran Province
In applied studies, the investigation of the relationship between a plant species and environmental variables is essential to manage ecological problems and rangeland ecosystems. This research was conducted in summer 2016. The aim of this study was to compare the predictive power of a number of Species Distribution Models (SDMs) and to evaluate the importance of a range of environmental variabl...
متن کاملPredicting the Climatic Ecological Niche of Artemisia aucheri Boiss in Central Iran using Species Distribution Modeling
Changes in the geographical distribution of plants are one of the major impacts of the climate change. This study was aimed to predict the potential changes in the distribution of Artemisia aucheri Boiss in Isfahan rangelands. Therefore, six bioclimatic variables and two physiographic variables were used under the Generalized Linear Model (GLM), Flexible Denotative Analysis (FDA), Surface Range...
متن کاملتأثیر عوامل محیطی بر پراکنش زیستگاههای مطلوب جمعیتهای زمستان گذران هوبره آسیایی در فلات مرکزی ایران
Predicting species’ distribution is a prerequsit for assessing threats, determining conservation status, and planning conservation programs. Asian houbara Chlamydotis macqueenii is one of the most valuable game species threatened by extinction. We estimated the distribution of potential suitable habitats of wintering populations of Asian houbara bustard in central Iranian plateau using maximum ...
متن کامل